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How Far Can You Get By Combining Change Detection 

Algorithms? 

Simone Bianco, Gianluigi Ciocca, and Raimondo Schettini 


Abstract —In this paper we investigate how state-of-the-art change 
detection algorithms can be combined and used to create a more 
robust change algorithm leveraging their individual peculiarities. We 
exploited Genetic Programming (GP) to automatically select the best 
algorithms, combine them in different ways, and perform the most 
suitable post-processing operations on the outputs of the algorithms. In 
particular, algorithms’ combination and post-processing operations are 
achieved with unary, binary and n-ary functions embedded into the GP 
framework. Using different experimental settings for combining existing 
algorithms we obtained different GP solutions that we termed IUTIS (/// 
Unity There Is Strength). These solutions are then compared against state- 
of-the-art change detection algorithms on the video sequences and ground 
truth annotations of the ChandeDetection.net (CDNET 2014) challenge. 
Results demonstrate that using GP, our solutions are able to outperform 
all the considered single state-of-the-art change detection algorithms, as 
well as other combination strategies. 

Index Terms —Change detection, algorithm combining and selection, 
genetic programming, CDNET. 

I. Introduction 

ANY computer vision applications require the detection of 
changes within video streams. For example, video surveillance 
applications need to track moving objects or identify abandoned ones 
to trigger event-related actions [1] by analyzing and monitoring the 
video content. Other applications that need video change detection 
are smart environments and video indexing and retrieval. For all these 
applications, a robust change detection algorithms with low false 
alarm rates is required as a pre-processing step. In the last decades, 
many algorithms have been proposed to solve the problem of video 
change detection 

The simplest strategy to detect foreground regions in video is to 
directly subtract the pixels in the current frame from those in a 
previous or reference one ( 7 ). Clearly this approach, although very 
efficient is prone to many false positive errors due to video noise 
and global illumination changes. To limit these issues, temporal or 
adaptive filters can be applied to build the background model. For 
example median filter (e.g. |8]-| 10 ], Kalman filter (TTJ-JT3J, and a 
simplified version of Kalman filter called Wiener filter 10 have been 
applied. Pixel’s values can also be analyzed in a given time slot using 
color histograms and considering the mode 0 

Another way to represent statistically the background is to consider 
the history over time of the values of the pixels. For example, 
the background can be modeled as a single Gaussian 0 or a 
Mixture of Gaussians 0- The latter overcomes the limitation of 
the unimodal model that cannot handle dynamic background motion. 
The approaches using the Gaussian model can be also extended 
by incorporating the generalized Gaussian model (T8j, (19) . Also 
Bayesian approaches have been proposed to cope with backgrounds 
having large variations (20) , 0i- For example, in (22) , is proposed 
a Bayesian framework that incorporates spectral, spatial, and tempo¬ 
ral features to characterize the background appearance in complex 
environments, while in ||23j the model uses spatial statistics of the 
neighboring pixel values to robustly detect the foreground against 
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object’s shadows. An hybrid moving object detection system that 
uses motion, change, and appearance information for more reliable 
detections is presented in (24). The method called Flux Tensor with 
Split Gaussian models (FTSG) uses a split Gaussian method to 
separately model foreground and background. 

The above statistical methods require the definition of the param¬ 
eter’s model. Non parametric methods directly rely on the observed 
data to statistically model the background (25) . Although these 
methods can deal with fast changes in the background, they are time 
consuming, and have an high memory requirement. Improvements 
have been proposed to overcome these problems e.g. |26)-(28). 

Sample consensus is another non parametric strategy used to 
model background pixels. It has been recently used in ViBe (29) 
and SuBSENSE (30) , (3l) . Sample consensus determines if a given 
observation should be considered foreground or background based on 
its similarity to recently observed samples. The two works differ in 
the features used in describing and in that SuBSENSE introduces a 
feedback mechanism to continuously improving the pixel’s modeling. 
In order to reduce the number of samples to be used to model 
the background, (32) exploits a small set of adaptive background 
templates. The templates are automatically discarded based on their 
estimated efficacy and new templates are added in their stead. 

Subspace learning is another family of background modeling 
strategy. The frame image is considered as a whole and, by taking 
into account spatial information, they are more robust to illumination 
changes. For example, Oliver et al. (33) proposed an eigenbackground 
model. A set of images are used to build a vectorization representation 
of the scene within a given time frame. This representation is then 
decomposed via Principal Component Analysis to determine the 
background via the most descriptive eigenvectors. Using a subset 
of eigenvectors makes the background more robust to illumination 
changes. The need to efficiently update the background model within 
the video sequence have generated many variants such as Incremental 
(34) , Incremental Non-Negative Matrix Factorization (35) and robust 
Matrix Factorization (36] |. 

Other methods try to learn information about background or fore¬ 
ground by using some machine learning technique. These techniques 
conveniently incorporate domain knowledge from available samples. 
For example, in (37), the normalized optical flow and normalized 
frame differences are used with a probabilistic SVM to build a 
background block classifier. In (38) , color, gradient and Haar-like 
features are integrated to handle spatio-temporal variations for each 
pixel within a Kernel Density Approximation framework, while 
background subtraction is performed using SVM. To overcome the 
problem of providing large sets of positive and negative examples 
to the learner, (39) proposes a 1-class SVM method that is able to 
update the classifier’s parameters online. 

Also, neural network-based solutions have received considerable 
attention. For example, the background segmentation approach pro¬ 
posed by (40) relies on a Probabilistic Neural Networks combining 
a neural network for background modeling and a Bayesian classifier 
for pixel’s foreground/background detection. In (41) the SC_SOBS 
algorithm models the background with the weights of a neural 
network. A modified version of the algorithm is proposed in |42| 
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where the neural network is used to specifically detect moving object 
for PTZ cameras. A weightless neural network is proposed in ED 
The approach is called CwisarDH and uses a buffer of pixel values 
to store previous foreground values in order to make the algorithm 
more robust against intermittent objects. 

All the above mentioned approaches, based their analysis on visual 
features computed on the video frames, either at pixel’s level or at 
higher resolution level. A physics-based change detection approach is 
proposed in (43) . It uses image formation models to computationally 
estimate a consistent physics-based color descriptor of the spectral 
reflectance of foreground and background surfaces. 

Regardless of the rationale of the approach, any background 
subtraction technique goals is to segment the scene into foreground 
and background components while trying to cope with the challenges 
that can be found in a real-world videos such as high variation 
in environments conditions, illumination changes, shadows, camera- 
induced distortions and so on. The output of a background subtraction 
algorithm is thus generally noisy, with isolated pixels, holes, and 
jagged boundaries. Post-processing of the foreground component, 
ranging from simple noise removal to complex object-level tech¬ 
niques, has been investigated to improve the algorithm’s accuracy. 
Results indicate that significant improvement in performance are 
possible if a specific post-processing algorithm and the corresponding 
parameters are set appropriately (44) . To cope with the variability 
of real-world videos, algorithms are becoming increasingly complex 
and thus computationally expensive. Parallelization of background 
subtraction algorithms on GPU is a possible way to make them usable 
in real time applications (45)— ED- 

Notwithstanding the improvements, change detection algorithms 
have been demonstrated to perform well on some types of videos but 
there is no single algorithm that is able to tackle all the challenges in a 
robust way. This can be clearly seen in the ChangeDetection.net (CD- 
NET) 2014 competition (48) , (49) (which follows the CDNET 2012 
competition (50) ) where change detection algorithms are evaluated on 
a common dataset composed of different types of videos sequences 
and classified according to their performance. As a consequence a 
combination of multiple change detection algorithms can considered 
in order to achieve better overall results. 

Several attempts to combine the outputs of different change de¬ 
tection algorithms have been investigated. For example the creators 
of the CDNET challenge have evaluated the performances of the 
top-3, top-5 and all the 28 reviewed using a simple majority vote 
fusion strategy (49) . The first two fusion schemes obtained the best 
results in the seven performances measure with respect to the top- 
ranked algorithm and even with respect to the fusion of all the 28 
tested algorithms. Also in ED a fusion scheme using majority vote 
is explored. In this case, the results of 22 algorithms have been fused 
as well as subsets of 3, 5 and 7 methods. Results show that the 
combination of different algorithms perform better with respect to 
single ones. 

The problem of combining the change detection algorithms is sim¬ 
ilar to the problem of classifier combining or fusion. By considering 
the output masks generated by the algorithms as responses of pixel- 
based classifiers, the problem can be seen as selection and combina¬ 
tion of, in our case, binary classifiers (foreground vs. background). 
Several works studied the problem of classifier combining or fusion 
(e.g. (52)-(57)) 

Also in the context of image segmentation, the fusion of different 
algorithm’s outputs is often exploited to obtain a more robust seg¬ 
mentation algorithms. For example, in (58) , several fusion rules are 
investigated to improve segmentation accuracy. Specifically, the Me¬ 
dian, Mean, Product, Minimun, and Maximum rules are considered, 
with the mean rule obtaining the overall best performances on the 


segmentation datasets considered. 

If the outputs of the classifier can be expressed as posteriori 
probabilities, a Bayesian methodology can be used to integrate the 
belief measure associated with each classifier to provide a combined 
final belief (59) . For example in [60], (6l] | the STAPLE algorithm 
uses an Expected Maximization strategy for estimating the “ground 
truth ’ segmentation from a group of expert segmentations in the con¬ 
text of medical imaging. STAPLE takes different segmentation and 
simultaneously estimates the final segmentation and the sensitivity 
and specificity parameters characterizing the performance of each 
expert. A similar approach is also is used in (62) to estimate the final 
segmentation from atlas-based segmentations of three-dimensional 
con-focal microscopy images of bee brains. In (63) the PRIF fusion 
scheme is presented. This scheme is based on a Markovian Bayesian 
fusion procedure, and the fusion is guided by the Probabilistic Rand 
Index (64)). This index measures the agreement of one segmentation 
result to multiple ground-truth segmentations, in a quantitative and 
perceptual way. 

Recently (65) formalizes the problem of image segmentation fusion 
as a combinatorial optimization problem in terms of information 
theory. To reduce the computational complexity required with respect 
to previous optimization techniques a generative Bayesian image 
segmentation fusion model (BISF) is proposed. 

In this paper we investigate how to create a new change detection 
algorithm by combining existing ones in a smart way. We aim at 
obtaining a new algorithm which is robust for different computer 
vision applications. To build our change detection algorithm from the 
results of existing ones, we rely on Genetic Programming (66). As 
input we feed it the set of the binary foreground/background masks 
that correspond to the outputs of the change detection algorithms, 
and a set of unary, binary, and n-ary functions to perform both 
mask’s combination (e.g. logical AND, logical OR) as well as mask’s 
post-processing (e.g. filter operators). The solution tree obtained by 
Genetic Programming will give our change detection algorithm. 

The advantage of using Genetic Programming is threefold. First, 
we are able to automatically select the algorithms that gives the best 
overall results over a set of predefined ones. Second, how to combine 
algorithms with which other ones to generate intermediate masks is 
automatically deduced. Third, which kind of post-processing of the 
original or intermediate masks to be applied in order to improve 
the results, is automatically built from the unary, binary and n-ary 
functions. 

The organization of the paper is as follows. Section [IT] illustrates 
how Genetic Programming is used to generate the combined change 
detection algorithm. In Section |III-A| we describe the experimental 
setup used in the evaluation of the proposed solution. Results are 
reported and discussed in Sections |ilI-B| Finally Section|TV|concludes 
the paper. 

II. Genetic Programming-Based combination 

Since there is no clear procedure to obtain a robust change 
detection algorithm by combining existing ones, a possible solution 
could be that of designing it by a trial-and-error process as done 
in (4£) and ED where different algorithms selections are tested. 
Instead, we propose an approach to automatically determine the best 
selection and combination of algorithms using Genetic Programming 
(GP) (66). 

The major difference between GP and the other evolutionary 
algorithms is that GP is a domain-independent evolutionary method 
that genetically breeds a population of functions, or more generally, 
computer programs to solve a problem. In our case the solutions 
correspond to fusion strategies. 
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Algorithm 1: GP 


1 begin 


n 

12 

13 

14 

15 


Generate a population P composed of an even number N 
of individuals ; 

Generation <— 0 ; 

repeat 

Calculate the fitness / of all the individuals in 
population P ; 

Create a new empty population P' ; 

repeat 

Select two individuals Ci ,Cj G P using the chosen 
selection algorithm ; 

Perform the crossover between Ci and Cj with 
probability p c , and let Ci anc/C/ be the offspring. If 
crossover is not applied, let Ci = Ci and Cj = Cj; 
Mutate each character of C % and Cj with certain 
probability p m , and let Ci and Cj be the offspring; 
Insert Ci and Cj into population P' ; 
until until population P' is composed of exactly N 
individuals’. 

Perform the copy P £- P' and delete P ; 

Generation <— Generation + 1 ; 
until a termination condition is satisfied ; 


The solutions can be represented as trees, lines of code, expressions 
in prefix or postfix notations, strings of variable length, etc. We 
use the representation first introduced in |[66j : potential solutions are 
represented as LISP-like tree structures built using a set of terminal 
symbols T and a set of nonterminal or functional symbols T. The 
iterative process of a generic GP is given in Algorithm [T] 

Before running GP, we need to set a number of parameters, which 
are as follows: 

— The sets T and T of functional (or nonterminal) and terminal 
symbols that are used to build the potential solutions. 

— The fitness function /(•). 

— The population size N. 

— The maximum size of the individuals, typically expressed as the 
maximum number of tree nodes or the maximum tree depth. 

— The maximum number of generations. 

— The algorithm used to initialize the population. A set of initial¬ 
ization algorithms can be found in (66)). 

— The selection algorithm. 

— The crossover rate. 

— The mutation rate. 

— Presence or absence of elitism (i.e. preserving unaltered the best 
solutions to the next iteration). 

Given a set of n change detection algorithms C = {Ci}£= ls the 
solutions evolved by GP are built using the set of functional (or 
nonterminal) symbols T and the set of terminal symbols T — C. 
The functional symbols corresponds to operations performed on the 
inputs. We explicitly incorporated into the GP framework the list of 
operations given in Table |T| along with their functional symbols. They 
operate in the spatial neighborhood of the image pixel, or combine 
(stack) the information at the same pixel location but across different 
change detection algorithms. 

We define the fitness function used in GP taking inspiration from 
the CDNET website, where change detection algorithms are evaluated 
using different performance measures and ranked accordingly. Given 
a set of video sequences V = {Vi,..., Vs}, a set of performance 


TABLE I 

The set of functional symbols used in GP and their 

CORRESPONDING OPERATORS. 


Function 

Inputs 

Domain 

Effect 

ERO 

1 

spatial 

Morphological erosion with a 3 x 

3 square structuring element 

DIL 

1 

spatial 

Morphological dilation with a 
3x3 square structuring element 

MF 

1 

spatial 

Median filter with a 5 X 5 kernel 

OR 

2 

stack 

Logical OR operation 

AND 

2 

stack 

Logical AND operation 

MV 

>2 

stack 

Majority vote 


measures M — {mi, ..., trm } the fitness function of a candidate 
solution Co, /(Co) is defined as the average rank across video 
sequences and performance measures: 

1 M / 

f(Co) = M E ( rankc o (A; {mj(C k (V))} n k=1 ) + 

3 = 1 ' 

+ X>i*(Co)) (1) 

i =1 ' 

where rankc 0 (-) computes the rank of the candidate solution Co 
with respect to the set of algorithms C according to the measure 
vrij. Pi (Co) is defined as the signed distance between the candidate 
solution Co and the best algorithm in C according to the measure 

m j : 


Pi (Co) 


' —rrij(Co(V)) + max rrij(Ck(V)) 
c k ec 

if the higher mj the better 

rrij(C 0 (V)) - min TOj(C fc (V)) 

C k £C 

if the lower mj the better 


( 2 ) 


and P2(Co) is a penalty term corresponding to the number of 
different algorithms selected for the candidate solution Co: 


P 2 (C 0 ) 


# of algorithms selected in Co 
# of algorithms in C 


( 3 ) 


The role of Pi is to produce a fitness function /(Co) E K, so 
that in case of candidate solutions having the same average rank, 
the one having better performance measures is considered a fitter 
individual in GP. The penalty term P 2 is used to force GP to select 
a small number of algorithms in C to build the candidate solutions. 
The relative importance of Pi and P 2 is independently regulated by 
the weights w\ and W 2 respectively. 

Our proposed combination strategy is able to simultaneously 
achieve three goals: algorithm selection, combination and processing. 
The set of functional symbols in Table [T] act both as aggregation 
functions for the combination, as well as image processing functions 
(specifically the local functions such as the morphological ones 
and the median filter). Moreover, the nature of the GP algorithm 
coupled with the penalty factor P2, allow us to perform automatic 
algorithm selection in a seamless manner during the generation of the 
intermediate solutions. This is an advantage of our proposed strategy 
compared to other combination algorithms where the selection has to 
be performed in advance. 


III. Combining state-of-the-art algorithms 
In this section we describe the experimental setup used in com¬ 
bining state-of-the art algorithms and the results. We based our 
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experiments on the CDNET 2014 challenge (48) that has received 
great attention in the evaluation of change detection algorithms. It 
provides a set of video sequences of various categories that are can 
be used to test the algorithms on different environment conditions. 
Moreover, it provides an evaluation protocol that can be used to 
compare the performances of change detection algorithms against 
each other. 

A. Experimental setup 

As the set of algorithms to be combined we considered the 
top ranked ones that have been evaluated within the CDNET 
2014 challenge as of July 2014. Table [TT] shows the top 10 
algorithms listed according to their average ranking across video 
categories. The outputs of the various algorithms are available on 
the website allowing us to perform our combination experiment 
on “ certified ” data. In our experiments, we selected a set of 
the top ranked algorithms, executed the GP algorithm using a 
training set of video sequences of the CDNET 2014 challenge, 
and then evaluated the obtained solution on the remaining 
video sequences. In particular, we created the GP combination 
algorithm using the shortest video sequences, one for each 
category. Specifically, the sequences used for training are (cate¬ 
gory/sequence): baseline/pedestrians, dynamicBackground/canoe, 
cameraJitter/badminton, intermittentObj ectMotion/parking, 

shadow/peoplelnShade, thermal/park, badWeather/wetSnow, 
lowFramerate/tramCrossroad_l fps, nightVideos/winterStreet, 

PTZ/zoomlnZoomOut, and turbulence/turbulence3. 

As for performance measures we computed them using the frame¬ 
work of the challenge that implements the following seven different 
measures: recall, specificity, false positive ratio (FPR), false negative 
ratio (FNR), percentage of wrong classifications (PWC), precision, 
and f-measure. A ranking of the tested algorithms is also computed 
starting from the partial ranks on these measures. 

We evaluate our proposed combining strategy against three dif¬ 
ferent fusion algorithms: a simple majority vote schema (MV), the 
STAPLE method 60 , [611 (STAPLE), and a probabilistic rand index- 
based algorithm 63 (PRIF). Following the challenge’s rules, each 
algorithm uses a single set of parameters. The STAPLE algorithm 
estimates its parameter on-line (i.e. the sensitivities and specificities 
of each expert) while for the PRIF algorithm we used the default 
parameters. The parameters used to initialize the GP algorithm are 
reported in Table m 

B. Results 

We applied GP to different sets of of algorithms in C using the 
top n algorithms in algorithms in Table [H| with n £ {3, 5, 7, 9}. The 
resulting algorithms, that we named IUTIS-3 and IUTIS-5 (quoting 
the Greek fabulists Aesop (620 BC-560 BC): “In Unity There Is 
Strength ”), are shown in Figure [2] and Figure [3] respectively. In the 
same figures, for each solution tree, an example of the output at each 
node on a sample frame is also reported. From the solution trees it 
is possible to notice that IUTIS-3 automatically created MV-3 in its 
right branch. IUTIS-5, instead, automatically created as part of the 
solution MV-3 and MV-5 in its left and right branch respectively. 
It is worth nothing that thanks to the filtering and post-processing 
operations automatically selected by GP, the average rank of IUTIS-3 
and IUTIS-5 are significantly lower than MV-3 and MV-5. In fact the 
average rank of IUTIS-3 is 3.41 points lower that that of MV-3, while 
IUTIS-5 is 4.42 points lower that that of MV-5. 

The GP solutions with n < 5 (i.e. IUTIS-3 and IUTIS-5) used all 
the algorithms available in C. The solutions obtained with n > 5 
are not reported since in these cases the GP algorithm found a 


solution identical to IUTIS-5. This is an evidence that GP is able 
to automatically select the best set of algorithms. This is one of 
the major difference between our method and the other fusion-based 
algorithms considered, which are not able to perform automatic 
algorithm selection, and thus use all the given algorithms. The effect 
is evident in Figure [T] where the average rank and F-Measure of 
the fusion-based algorithms on the CDNET 2014 challenge video 
sequences are reported: it is possible to see how the performance of 
PRIF and STAPLE decrease by increasing the number of algorithms 
available. MV-5 instead outperforms MV-3, but also in this case the 
performance decrease for n = 7 and 9. A different trend can be 
observed for IUTIS, which being able to perform automatic algorithm 
selection, is able to remove from the final solution those algorithms 
that could degrade the performance. The complete comparison of our 
proposed solutions to the single algorithms of Table [IT] and fusion- 
based algorithms in the state-of-the-art in terms of ranking on the 
CDNET 2014 challenge video categories is reported in Table [TV] It 
is possible to notice that all fusion-based algorithms with n < 5 
outperform all the single algorithm, while this is true only for IUTIS 
and PRIF for higher values of n. In particular we can observe that 
there are three categories on which IUTIS-5 is not ranked first, i.e.: 
Turbulence, Dynamic Background and Intermittent Object Motion. 

Outputs of some of the tested algorithms on sample frames in 
the CDNET 2014 dataset are shown in Figure [5] together with input 
images and ground truth masks. Detailed evaluation results of the 
IUTIS-3 and IUTIS-5 algorithms in terms of all the seven different 
measures for each category of the evaluation dataset are reported in 
Table [V] and [Vi] respectively. 

Since the proposed algorithm is stochastic by nature, we report in 
Figure [4] two variant solutions of found in different runs by our GP 
fusion scheme using the top 3 algorithms in Table [Tl] These solutions 
trees are similar to IUTIS-3 and have the same overall results. With 
respect to IUTIS-3 they have introns, i.e. they possess non-functional 
branches: the sequence of erosions and dilations in the left branch 
of the top tree in Figure [4] is equivalent to a single erosion; in the 
bottom tree, the top OR operation is uninfluential on the final result. 

Finally, Table |VII| reports the complete official ranking on the 
CDNET 2014 website at the moment of the submission (Nov 5 2015). 
As it can be seen, our solutions both outperform all the evaluated 
change detection algorithms including the most recent ones. 

IV. Conclusion 

In this paper we have presented an evolutionary approach, based 
on Genetic Programming, to combine change detection algorithms to 
create a more robust algorithm. The solutions provided by Genetic 
Programming allow us to automatically select the best subset of the 
input algorithms. This is one of the major difference between our 
method and the other fusion-based algorithms considered, which are 
not able to perform automatic algorithm selection, and thus use all 
the given algorithms. Moreover, we are able to automatically combine 
them in different ways, and perform post-processing on their outputs 
using unary, binary and n-ary operators embedded into the Genetic 
Programming framework. 

We have shown that applying our approach on the best algorithms 
in the state-of-the-art, we are able to create the IUTIS-5 algorithm 
that ranks first by a large margin among a total of 29 change detection 
algorithms on the CDNET 2014. 

As a future work we plan to investigate if the same approach, 
applied on very simple change detection algorithms, is able to 
create solutions with comparable results to more complex algorithm. 
Moreover, it would be interesting to use our framework to create new 
algorithms from scratch using atomic image processing operators as 
building blocks. 



TABLE II 

TOP 9 CHANGE DETECTION ALGORITHMS LISTED ON THE CDNET 2014 CHALLENGE WEBSITE (AS OF JULY 2014) RANKED BY THEIR AVERAGE 

RANKING ACROSS THE 11 VIDEO CATEGORIES. 
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Rank Method 

1 FTSG 

2 SuBSENSE 

3 CwisarDH 

4 Spectral-360 

5 AMBER 

6 KNN 

7 SC_SOBS 

8 RMoG 

9 KDE 


Description 

Flux Tensor with Split Gaussian models 

Self-Balanced SENsitivity SEgmenter 

Change Detection with Weightless Neural Networks 

Change Detection based on Spectral Reflectaces 

Extension of the Adapting Multi-resolution Background ExtractoR 

Adaptive Gaussian Mixture Model 

Spatial Coherence Self-Organizing Background Subtraction 

Region-based Mixture of Gaussians 

Change detection based on Kernel Density Estimation 


Reference 

_ 24 

31 

67 
43 

32 
26 
41 

68 
69 




# of algorithms available for the combination (n) # of algorithms available for the combination (n) 

Fig. 1. Plots of the average rank (left) and F-measure (right) for the different fusion-based algorithms considered by varying the number of algorithms 
available in in the combination, i.e. n = 3, 5, 7, 9 




Fig. 2. IUTIS-3 solution tree and its example masks. SBS, FTS, and CWS 
refer to SuBSENSE, FTSG, and CwsarDH algorithm respectively. 


TABLE III 

Set of parameters used in GP. 


Parameter name 

Setting 


Functional symbols 

T (see Sec. 

IT 


Terminal symbols 

T (see Sec. 

if 



Fitness function 

Population size 
Max tree depth 
Max number of generations 
Initialization algorithm 
Selection algorithm 
Crossover rate 
Mutation rate 
Elitism 


/(•) defined in Eq. 1 1 |with [w±,W 2 ] = 
[ 0 . 01 , 0 . 01 ] 

50 

Dynamic 

100 


Ramped half-and-half 66 


Tournament, with tournament size 5 


Adaptive 70 


Adaptive 70 
Yes 


References 

[1] M. Sedky, M. Moniri, and C. Chibelushi, “Classification of smart video 
surveillance systems for commercial applications,” in Advanced Video 
and Signal Based Surveillance, 2005. AVSS 2005. IEEE Conference on, 
2005, pp. 638-643. 

[2] R. Radke, S. Andra, O. Al-Kofahi, and B. Roysam, “Image change 
detection algorithms: a systematic survey,” Image Processing, IEEE 
Transactions on, vol. 14, no. 3, pp. 294-307, 2005. 











































TABLE IV 

Comparison of our proposed solutions to single and lusion-based algorithms in the state-of-the-art in terms of rank in each 

VIDEO CATEGORY AND AVERAGE RANK. 
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Method ID 

Avg rank 

Overall 

Bad 

Weath. 

Low 

F.rate 

Night 

Videos 

PTZ 

Turb. 

Base. 

Dynam. 

Backg. 

Camera 

Jitter 

Interm. 

Obj. M. 

Shadow 

Therm. 

IUTIS-5 

1.25 

1 

1 

1 

1 

1 

2 

1 

2 

1 

2 

1 

1 

IUTIS-7 

1.25 

1 

1 

1 

1 

1 

2 

1 

2 

1 

2 

1 

1 

IUTIS-9 

1.25 

1 

1 

1 

1 

1 

2 

1 

2 

1 

2 

1 

1 

PRIF-3 

4.58 

2 

3 

2 

3 

3 

5 

3 

9 

7 

6 

6 

6 

IUTIS-3 

4.67 

3 

2 

3 

8 

5 

9 

4 

4 

4 

8 

3 

3 

PRIF-5 

5.58 

4 

7 

6 

5 

4 

1 

10 

3 

8 

7 

8 

4 

MV-5 

5.67 

5 

9 

8 

2 

6 

10 

2 

10 

3 

3 

6 

4 

STAPLE-3 

6.00 

6 

4 

5 

4 

8 

6 

5 

5 

8 

12 

2 

7 

PRIF-7 

6.50 

8 

10 

4 

9 

2 

3 

9 

1 

6 

16 

5 

5 

MV-3 

8.08 

7 

11 

11 

6 

13 

8 

6 

12 

5 

4 

5 

9 
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TABLE VI 
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